NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Vision Paper: Proof-Carrying Code Completions

https://doi.org/10.1145/3691621.3694932

Kamran, Parnian; Devanbu, Premkumar; Stanford, Caleb (October 2024, ACM)

Code completions produced by today’s large language models (LLMs) offer no formal guarantees. We propose proof-carrying code completions (𝑃𝐶3). In this paradigm, a high-resourced entity (the LLM provided by the server) must provide a code completion to- gether with a proof of a chosen safety property which can be inde- pendently checked by a low-resourced entity (the user). In order to provide safety proofs without requiring the user to write specifica- tions in formal logic, we statically generate preconditions for all dangerous function calls (i.e., functions that may violate the safety property) which must be proved by the LLM. To demonstrate the main ideas, we provide a prototype imple- mentation in the program verification language Dafny, and a case study focusing on file system vulnerabilities. Unlike Python code generated by GPT-4, Dafny code generated by 𝑃𝐶3 provably avoids a common weakness related to path traversal (CWE-35), using a single generation attempt (𝑘= 1)and a modest number of to- kens (3,350). Our tool is available as an open source repository at https://github.com/DavisPL/PCCC.
more » « less
Full Text Available
Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)

https://doi.org/10.1145/3597503.3639183

Ahmed, Toufique; Pai, Kunal Suresh; Devanbu, Premkumar; Barr, Earl (April 2024, ACM)

Full Text Available
Better Patching Using LLM Prompting, via Self-Consistency

https://doi.org/10.1109/ASE56229.2023.00065

Ahmed, Toufique; Devanbu, Premkumar (September 2023, IEEE)

Full Text Available
SynShine: Improved Fixing of Syntax Errors

https://doi.org/10.1109/TSE.2022.3212635

Ahmed, Toufique; Ledesma, Noah Rose; Devanbu, Premkumar (April 2023, IEEE Transactions on Software Engineering)

Full Text Available
Large Language Models and Simple, Stupid Bugs

https://doi.org/10.1109/MSR59073.2023.00082

Jesse, Kevin; Ahmed, Toufique; Devanbu, Premkumar T.; Morgan, Emily (May 2023, Int'l Conference on Mining Software Repositories)

Full Text Available
Few-shot training LLMs for project-specific code-summarization

https://doi.org/10.1145/3551349.3559555

Ahmed, Toufique; Devanbu, Premkumar (October 2022, ACM)

Full Text Available
FlexType: A Plug-and-Play Framework for Type Inference Models

https://doi.org/10.1145/3551349.3559527

Voruganti, Sivani; Jesse, Kevin; Devanbu, Premkumar (October 2022, Automated Software Engineering Conference)

Full Text Available
Extending Source Code Pre-Trained Language Models to Summarise Decompiled Binaries

https://doi.org/10.1109/SANER56733.2023.00033

Al-Kaswan, Ali; Ahmed, Toufique; Izadi, Maliheh; Sawant, Anand Ashok; Devanbu, Premkumar; van_Deursen, Arie (March 2023, IEEE)

Full Text Available
NatGen: generative pre-training by “naturalizing” source code

https://doi.org/10.1145/3540250.3549162

Chakraborty, Saikat; Ahmed, Toufique; Ding, Yangruibo; Devanbu, Premkumar T.; Ray, Baishakhi (November 2022, ESEC/FSE 2022: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering)

Full Text Available
Multilingual training for software engineering

https://doi.org/10.1145/3510003.3510049

Ahmed, Toufique; Devanbu, Premkumar (May 2022, 2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE))

Well-trained machine-learning models, which leverage large amounts of open-source software data, have now become an interesting approach to automating many software engineering tasks. Several SE tasks have all been subject to this approach, with performance gradually improving over the past several years with better models and training methods. More, and more diverse, clean, labeled data is better for training; but constructing good-quality datasets is time-consuming and challenging. Ways of augmenting the volume and diversity of clean, labeled data generally have wide applicability. For some languages (e.g., Ruby) labeled data is less abundant; in others (e.g., JavaScript) the available data maybe more focused on some application domains, and thus less diverse. As a way around such data bottlenecks, we present evidence suggesting that human-written code in different languages (which performs the same function), is rather similar, and particularly preserving of identifier naming patterns; we further present evidence suggesting that identifiers are a very important element of training data for software engineering tasks. We leverage this rather fortuitous phenomenon to find evidence that available multilingual training data (across different languages) can be used to amplify performance. We study this for 3 different tasks: code summarization, code retrieval, and function naming. We note that this data-augmenting approach is broadly compatible with different tasks, languages, and machine-learning models.
more » « less
Full Text Available

« Prev Next »

Search for: All records